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Design and Chemical Synthesis of a Neoprotein Structural Model for the 
Cytoplasmic Domain of a Multisubunit Cell-Surface Receptor: Integrin aii:.^: 

(Platelet GPIIb-IIIay 
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The Scripps Research fnsiitute. 10666 North Torrey Pines Road. La Jolla. California 9203' 
Receiced December 10. 1993; Revised Mamiscript Received April 4, 1994^ 

ABSTRACT- Integrins are a class of heierodimeric cell adhesion receptors involved in cell migration, cci: 
anchoraae. and cell-cell interactions. The cytoplasmic domains of iniegrms are ot key ^rnportance m these 
activities We have designed and chemically synthesized a 1 26 ammo acid model protein (MP- ) contamin^c 
both cvioplasmic tails of the platelet-derived integrin au,^i covalently linked via a helical coiled coil. Tn. 
coiled-coil tertiary structure was incorporated to mimic the membrane-spanning domain of the mtegrin ano 
to act as a topological constraint fixing the two cytoplasmic tails in a parallel arrangement. This molecule 
which contains two C-termini, was constructed by chemical dovetailing. The bromoacetylated and cystein> I 
peptide synthons were unambiguously -ligated through the formation of a thioether linkage. Lltraviolc: 
circular dichroism (CD) spectroscopy has been performed on MP-l and related compounds, confirmin^c 
that a helical coiled coil is present within the MP- 1 molecule. Significantly, the helicity apparently extenas 
beyond the predicted amphiphilic region of MP-1. Fluorescence measurements suggest that a defined 
tertiarv structure has formed bv the association of the two cytoplasmic domains. We conclude that this 
is a practical design strategy for the study of the cytoplasmic domain of multisubunit cell-surface receptors. 



Many integral membrane proteins transmit signals across 
the plasma membrane. Technical difficulties have greatly 
limited the application of high-resolution techniques to the 
structures of these proteins. Indeed molecular structures are 
available for only two intact transmembrane proteins, a 
bacterial pholoreaction center (Deisenhofer et al., 1985) and 
a porin (Weiss et ah, 1990). Structures of receptor extra- 
cellular domains have been determined using soluble truncated 
extracellular domains as models (De Vos et al., 1 992; Milburn 
et al., 1991). These structures have contributed greatly to 
our understanding of the basis of ligand recognition, but have 
provided less insight into the mechanism of signal transduction. 
Many membrane proteins that transduce signals are members 
of the type I transmembrane protein family, the defining 
feature of which is a single membrane spanning region. These 
include the T cell receptor (Weiss, 1993), growth factor 
receptors (Patthy, 1990), and cytokine receptors (Miyajima 
et al., 1992). In general the cytoplasmic domain of these 
proteins is critical for signaling. Thus, to understand signal 
transduction through such receptors, it is essential to under- 
stand the structure and function of the cytoplasmic domain. 
This is especially difficult for multisubunit type I receptors. 
To address this issue we have developed a novel strategy for 
the study of the cytoplasmic domain of multisubunit receptors 
and applied this to the study of a member of the integrin 
family. 

Integrins are heterodimeric transmembrane receptors found 
in almost all cell types [for review, see Hynes (1992)]. Each 
subunit (a or .J) has a single membrane spanning domain and 
a cytoplasmic extension. Integrin heterodimers are centrally 
involved in many processes involving bidirectional transmission 
of signals across the plasma membrane (Hynes, 1992). One 
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of the most intensively studied members of the integrin ran:iil> 
is the platelet-derived integrin anb^3- This integrin has beer- 
reported to function through both outside-in and inside-ou: 
signaling, and its cytoplasmic domain has been impH».Mied ir. 
both processes (Ginsberg etal., 1992;Sastry & Horwii/. 1993; 
Structure/function studies on an in vitro model v^' th': 
cytoplasmic domain of cotild provide a more cohesive 
understanding of signal transduction through this rc^'wn o: 
the protein. 

The inherent hydrophobicity of membrane-spanning 
stretches of apolar amino acids means that structural studie-- 
on truncated receptors containing these helical regions will be 
impeded by insolubility in aqueous solution. In the preser/- 
study, we introduce a new approach to overcome this solubility 
problem by replacing the hydrophobic membrane-spanning 
sequence with a highly soluble amphiphilic peptide designed 
to mimic the helical secondary structure of the native 
membrane-spanning domain." The membrane-spanning seg- 
ments are linked together covalently in the correct orieniatior. 
by means of chemical dovetailing. This novel strategy, which 
makes use of advances in both the de novo design and the 
chemical synthesis of proteins, has been used to construct >- 
model protein designed to mimic the structural properties of 
the cytoplasmic domains of the prototypical platelet integrir. 
am^3. In this paper we describe a general strategy for the 
design and chemical synthesis of synthetic model protein*, 
which will allow structural and functional information to be 
obtained on the cytoplasmic domains of multichain receptor-. 

EXPERIMENTAL PROCEDURES 

Materials. Boc-amino acids* and HBTU were ubiainc'i 
from Novabiochem (San Diego, CA). Preloaded Boe-aminr/- 
acyl-0CH2-Pam-resins and diisopropylethylaminc (DIEA; 
were purchased from Applied Biosystems (Foster City, CA; 
4-MethylbenzhydryIaminc-resin (Lot No. 023863) was pur- 
chased from Peninsula Labs (San Carlos, CA). N,N-0\- 
methylformamide was purchased from Mallinckrodi C hcmica 
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Co. HPLC-gradc acetonitrile was purchased from EM 
Science (Gibbstown, N J). All other reagents were AR grade 
from Aldrich Chemical Co. 

Peptide Synthesis. All peptides were synthesized manually 
according to our in situ neutralization/HBTU activation 
protocol for Boc solid-phase chemistry as described previously 
(Schnolzeretal., 1992b). Coupling yields were monitored by 
the quantitative ninhydrin determination of residual free amine 
(Sarinetal., 1981). Peptide a-carboxamides were constructed 
on a 4-methylbenzhydrylamine-resin, and all other peptides 
weri synthesized on appropriate Boc-aminoacyl-OCH2-Pam- 
resins. Where required, the bromoacetyl group was introduced 
at the N"-ierminal of a peptide by coupling as the preformed 
symmetric anhydride. In all cases, side-chain protecting 
a'roups were removed and the peptides cleaved from the resin 
bv treatment with liquid HF containing 47c p-anisole, for I 
h at 0 ''C. Crude peptide products were precipitated and 
washed with diethyl ether before being dissolved in aqueous 
acetic acid (10-30%) and lyophilized. 

Reierse-Phase HPLC. Analytical and semipreparative 
gradient HPLC were performed on a Rainin dual pump high- 
pressure mixing system with 2l4-nm detection. Semi- 
preparative HPLC was run on a Vydac Cis column (10 ^tm, 
10X 250 mm) at a flow rate of 3 mL/ 

was performed on a Whatman Cis column (5 ^m* 4.0 X 140 
mm) at a flow rate of 1 mL/min. Preparative HPLC was 
performed on a Waters Prep 4000 system fitted with a Waters 
486 tunable absorbance detector. Preparative HPLC was 
run on a Vydac Cu column (15-20 Mm, 50 x 250 mm) at a 
flow rate of 30 mL/min. All runs used linear gradients of 
90% acetonitrile plus 0.1% TFA versus 0.1% aqueous TFA. 

Peptide Purification and Characterization. Crude peptides 
were dissolved in aqueous acetonitrile containing 0.1% TFA 
and purified by either semipreparative or preparative HPLC. 
Peptides containing cysteine residues were dissolved in HPLC 
buffers containing 5 mM dithiothreitol. All purified peptides 
were characterized by ion-spray mass spectrometry (see Table 

I). 

Mass Spectrometry. Ion-spray mass spectrometry of crude 
and purified peptide segments was performed on an API-Ill 
quadrupole ion-spray instrument (Sciex, Toronto, Ontario) 
as described (Schnolzer 1 992a). Expected masses of peptide 
targets were determined using MacProMass (Terri Lee, City 
of Hope, Duarte, CA). 

Synthesis of Model Protein MP- 1 . ' Ligation was initiated 
by combining Cys-helix-anb (5.2 mg, 0.9 ^imo\) and BrAc- 
helix-i33 (5.5 mg, 0.6 /xmol) in 1.1 mLof95% DMF/5% 0.1 
M sodium phosphate, pH 7.0 containing 8 M urea at 25 **C. 
The ligation reaction was monitored by reverse-phase HPLC 
(20-50% CH3CN over 30 min) and by ion-spray mass 
spectrometry. The ligation reaction was terminated after 12 
h and the product purified by preparative HPLC using a 
gradient of 30-607o CH3CN over 60 min. The lyophilized 
purified product was characterized by ion-spray mass spec- 
trometry [calcd 14 189.3 (monoisotopic), 14 198.9 (average 
isotope composition), found 14 194.2 ± 2.5). 

Synthesisof Model Protein MP'2. Cys-at!b (3-01 mg, 3.16 
^mol) and BrAc-j33 (1 1.78 mg, 2.1 ^tc\o\) were combined in 



* Abbcviations: Boc, rerr-butylo.xycarbonyi; Br Ac. bromoacetyl; CD. 
circular dichroism; DMF, yVA-dimcthylformamide; HBTU, 1-hydroxy- 
bcnzotriazole tctramcthyluronium hcxafluorophosphaie; HPLC. high- 
performance liquid chromatography; M P- 1 . model protein 1 ; MP-2, model 
protein 2; NMR. nuclear magnetic resonance; standard lUPAC single- 
letter abbreviations for amino acids arc used throughout; TFA. irifluo- 
roacctic acid; UV, ultraviolet. 



Table 1: Mass Spectromctric Characterization of Synthetic Peptides 

calcd 



peptide 


(monoisotopic) 


(av isotope 
composition) 


found 


rtUb 


2434.2 


2435.6 . 


2434,0 ± 0.5 


Cys-allb 


2537.2 


2538.7 


2537.0 ±0.9 


hclix-«Ub 


5452.8 


5456.1 


5454.3 ± l.l 


Cvs-helix-allb 


5555.8 


5559.: 


5558.2 ± 1.4 




5572.5 


5577.2 


5576.1 ±0.9 


Br.Ac-j^^j 


5693.8 


569S.: 


5697.3 ± 1.4 


BrAc-helix-.::?3 


8713.5 


371;^ 5 


ii7i5.4± 1.6 


Br.\c-helix 


3157.6 


3160.-: 


3158.3 ±0.6 


Cys-helix 


3140.7 


3142.6 


3I4L2 ±0.5 



1.98 mL of 9570 DMF/O.l M sodium phosphate, pH 7.0 at 
25 The ligation reaction was monitored by reverse-phase 
HPLC using a gradient of 20-50'^c CH3CN over 30 min. 
Peaks were collected and analyzed by ion-spray mass 
spectrometry. The ligation reaction was terminated after 18 
h and the product purified by preparative HPLC using a 
gradient of 20-50% CH3CN over 60 min. The lyophilized 
purified product was characterized by ion-spray mass spec- 
trometry [calcd 8151.1 (monoisotopic), 8156.9 (average 
isotope composition), found 8151.4 ± 1.0]. 

Synthesis of Helix Dimer, Ligation w as carried out by 
combining BrAc-[G-(K-L-E.A.L.E.G)4]-NH2 (16.2 mg, 5.3 
Mmol) and Cys-(G-(K.L.E-A-L<E-G)4]-NH: (13.5 mg, 4.3 
Mmol) in 1.5 mL of 95% DMF/O.l M sodium phosphate, pH 
7.0 at 25 ^C. The ligation reaction was monitored by reverse- 
phase HPLC (5-mL aliquots) using a gradient of 35-50% CH3- 
CN over 30 min. Peaks were collected and analyzed by ion- 
spray mass spectrometry. The ligation reaction was thus 
observed to be complete after 2.5 h and the product purified 
by semipreparative HPLC using a gradient of 35-50% CH3- 
CN over 30 min. The lyophilized purified product was 
characterized by ion-spray mass spectrometry [calcd 6219.0 
(monoisotopic), 6223.0 (average isotope composition), found 
6219.4 ± 0.8]. 

Ultraviolet Circular Dichroism Spectroscopy. Far-U V CD 
spectra were recorded on an AVIV 60DS spectropolarimeter 
linked to an AT&T computer, CD spectra are presented as 
a plot of mean molar ellipticity per residue {[8], deg cm- 
dmol'O versus wavelength in 0.5-nm increments. The digitized 
data was plotted using the Cricket Graph program on a 
Macintosh Ilsi computer. All peptide and protein samples 
were dissolved in 50 mM boric acid at pH 7:0 and their 
concentrations determined by quantitative amino acid analysis. 

Calculation of Protein Helicity. The percentage of helical 
secondary structure within a sample was estimated using eq 
1. The maximal theoretical ellipticity at 222 nm [^jmax was 
determined using relationship 2 where n is the number of 
residues per chain (Chen et al, 1974). 

% helix =[0]2,2/[^],,,x 100 (I) 

[^]max = -39500[l -(2.57//i)l ((^]. deg cm" dmoP*) (2) 

Size Exclusion Chromatography. Chromatography was 
performed on a Pharmacia FPLC system using a Superdex 
75 column [240 mm X 12 mm (i.d.)]. Peptide and protein 
samples ( 1 00 mL of 50 mM solution) were eluted at a flow rate 
of 0.5 mL/min with O.l M phosphate buffer at pH 7.0 
containing 0.5 M NaCl. Sample elution was monitored either 
by absorbance at 214 nm or by fiuorescencc at 335 nm 
(excitation at 280 nm) . A series of peptides of varying length 
derived from interleukin-8 and the fibroneciin tenth type III 
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Figure I: (A) Schematic representation of the presumed topology 
of the native integrin anb^j molecule. (B) Schematic representation 
of the designed model protein MP-1. 

module, as well as the anb and .J3 tails, were used as molecular 
weight standards. 

Fluorescence Quenching Studies. Protein fluorescence was 
measured on a Jasco FP-777 spectrofluorometer. The 
quenching buffer comprised 50 mM NH4OAC at pH 6.0 
containing 0. 1 6 M KCl and 0. 1 mM S^iSiOi. The KI stock 
solution ( 1 .5 M) w as prepared gravimetrically using quenching 
buffer. Protein stock solutions (typically 100 mM) were 
similarly prepared with quenching buffer. For each data point 
in the quenching experiment 200 of protein stock was 
added to the appropriate amount of KI stock and the final 
volume made to I mL with quenching buffer. All solutions 
were incubated for I h at 25 °C before measurements were 
taken. Fluorescence emission was monitored at 25 °C at 350 
nm with excitation at 278 nm (£m/ Ex slit width = 10 nm) and 
expressed as /F where and Fare the fluorescence of the 
protein in the absence and presence of quencher, respectively. 
The data are presented as direct Stern-Volmer plots of F°/F 
vs quencher concentration. Stern-Volmer constants Kq were 
calculated using eq 3. 



P/F= I +A:Q[r] 



(3) 



RESULTS AND DISCUSSION 



Model Protein Design, In designing a model protein for 
the cytoplasmic and transmembrane domains of the 
receptor, various structural constraints were considered. The 
cytoplasmic tails of the a\\h0z receptor emerge from the plasma 
membrane at defined points, resulting in the shorter ornb tail 
being staggered relative to the longer ji?3 tail (sec Figure I). 
Moreover, there is evidence to suggest that the anb and (3^ 
cytoplasmic tails interact and that this interaction is critical 
for proper function (Ylanneetal., 1993). Such a quaternary 
interaction suggests that the cytoplasmic tails are located close 
to one another in the integrin heterodimer ( Figure I ). Finally, 
there is indirect evidence to suggest that the transmembrane 
helices in au^ff^ interact within the membrane (Frachet et al., 
1992; Bennet et al., 1 993). and it is possible that the structure 
of the cytoplasmic domain itself is affected by the conformation 
of the membrane-spanning domain of a\\\fiy To ensure that 
each of the above properties is present within our model protein. 
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Figure 2: Amino acid sequences of the synthetic polypeptide 

targets: Helix dimer, Cvs-G-lK-L-E-A-L-E-GOiCHj-G-CK-L-E-A- 
L-E-GOv. MP-2, a„b-Cys-SCHr.i?]; MP- 1 . aub-helix-Cys-SCHrhelix- 

we have taken advantage of recent advances both in de novo 
design of peptides (Monera et al., 1993) and in the chemical 
synthesis of proteins (Muir & Kent, 1993). Innovations in 
these areas have allowed the rational design of the 1 26 amino 
acid model protein (MP- 1) shown in Figure 1. 

The MP-1 synthetic protein contains helical stretches 
designed to mimic the distinct membrane-spanning domains 
of the two polypeptide chains in the receptor molecule, one 
attached to each of the cytoplasmic tails from the anb and 33 
subunits. The sequences of the anb and 1^3 tails correspond 
to residues 989-1008 (Prandini et al., 1988) and 716-762 
(Fitzgerald et al., 1987), respectively (Figure 2). Note that 
there is evidence of post-translational side-chain amidation of 
the C-terminal glutamic acid in the anb tail; therefore, a 
glutamine residue was incorporated at this point in the synthetic 
sequence (Calvete et al., 1990). The sequence designed to 
mimic the membrane-spanning region assumes a helical 
conformation in these regions of the native molecule 
(Deisenhofer et al., 1985). We have sought to imitate the 
effects of this membrane-spanning structure on the attached 
cytoplasmic tails by the use of a coiled-coil structure, suitably 
modified to provide solubility under aqueous conditions. Each 
chain of the coiled coil is composed of a 29 amino acid 
amphiphilic sequence and is itself made up of four tandem 
repeats of a seven- residue core peptide [G'(K-L-E'A*L-E'G)4] . 
This particular sequence is derived from the prototypical coiled- 
coil protein tropomyosin (Sodek et al., 1978) and is known to 
form helical secondary structure in aqueous solution (Lau et 
al., 1984). Furthermore, synthetic peptides containing poly- 
meric assemblies of this seven-residue core sequence are known 
to adopt coiled-coil tertiary and quaternary structures in 
aqueous systems (Zhou et al., 1992). 

The stability of these coiled coils is largely a result of strong 
interchain hydrophobic interactions. between leucine residues 
intheseven-residuerepeat(Lauetal., 1984), By incorporating 
two of these amphiphilic segments into MP-l we meet the 
absolute requirement for helicity, but not at the price of 
insolubility as might be the case if the natural hydrophobic 
membrane-spanning sequence were used. Moreover, the 
tendency of these helical elements to associate to form coiled 
coils may better mimic the proximity of transmembrane helices 
in the natural system and also ensures that a defined topology 
is maintained between the a and j3 cytoplasmic tails. In other 
words, the coiled coil acts as a structural template onto which 
the cytoplasmic domain of the integrin can be attached. This 
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Figure 3: Synthesis of the model protein, MP-1, by chemical 
dovetailing. Schematic equation illustrating the intended reaction 
between Cys-helix-aub and BrAc-hclix-d). 

ensures that the two cytoplasmic tails are staggered with 
respect to one another in a manner which approximates the 
intact protein (see Figure 1). Helical coiled-coil structures 
have previously been used as templates for the presentation 
of a small peptide motif (Engel et ah, 199 1 ). To better gauge 
the structural impact of the coiled-coil template we also 
designed a series of reference compounds. These included a 
second model protein, designated MP-2, containing the 
cytoplasmic tails of the aw^&i receptor linked together in a 
head-to-head manner. This molecule contains no coiled-coil 
structural template. The individual cytoplasmic tails of the 
two subunits, on their own and with the additional amphiphilic 
sequence on their N-termini, were also deemed useful control 
molecules, as were the head-to-head linked coiled-coil segments 
themselves. The primary structures of all the component 
peptides and the assemblies of them are shown in Figure 2. 

Synthetic Strategy. The model proteins illustrated in 
Figures 1 and 2 have a somewhat unusual architecture, namely, 
two C-iermini. This feature renders them not directly 
accessible via either recombinant technology or standard 
chemical polypeptide synthesis. Clearly, a novel strategy must 
be employed. This synthetic challenge can be met by using 
the chemoselective ligation principle developed in this labora- 
tory (Schnolzer & Kent, 1992). This strategy simply involves 
the chemical dovetailing of two fully unprotected peptide 
segments by a chemoselective ligation reaction to form the 
mature target compound. The selectivity of this ligation 
reaction is imposed by incorporating unique, mutually reactive 
groups, one within each peptide segment to be joined. 

In the case of MP- 1, each half of the target molecule was 
individually constructed, and then these two intermediates 
were joined together through the N-termini of the helices. In 
this case, the ligation chemistry takes advantage of the fact 
that neither cytoplasmic tail of the auh^y receptor contains 
a cysteine residue. Thus by including a unique cysteine resi- 
due with its nucleophilic sulfhydryl at the N-lerminus of one 
half of MP- 1 and an electrophilic bromoacetyl unit at the 
N-terminus of the other half we can chemically dovetail the 
two pieces in the desired manner (see Figure 3). The same 
. principle applies to the synthesis of the control protein. MP-2, 
in which the two cytoplasmic tails are directly linked together. 

Optimum Ligation Conditions, The feasibility of this 
approach was tested by studying the chemical ligation of two 
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Figure 4: Chemical dovetailing of two peptide segments by means 
of a thioether bond. Study of reaction conditions using the pro- 
(coiled coil) peptides as model segments: (a) aqueous solution (0. 1 
M phosphate at pH 7.0); note high level of disulfide-linkcd peptide 
homodimer and relatively low yield of ligation product; (b) DMF- 
O.l M phosphate at pH 7.0 (95:5 v/v). The absence of the cysteinyl 
component after 2.5 h (only residual bromoacetylated component 
remains) indicates essentially quantitative formation of the desired 
thioether-linked ligation product. Note the absence of disulfidc- 
linked homodimer. 

derivatives of the amphiphilic helical peptide. Thus both the 
nucleophilic peptide segment Cys-G'(K'L-E-A«L*E-G)4 and 
the electrophilic component BrAc-G-(K»L'E-A*L*E'G)4 were 
synthesized as peptide amides, and their mutual reactivity 
was investigated. The choice of pH for the reaction is critical 
since it must be sufficiently high to ionize the side-chain SH 
of Cys and render it nucleophilic. Preliminary studies had . 
revealed that ligations could not be brought about between 
cysteine-containing peptides and bromoacetylated peptides 
at low pH (T. W. Muir, M. Schnolzer. and S. B. H. Kent, 
unpublished data). However, too high a pH will also 
deprotonate the «-amino groups in lysine side chains {pK^ 

10.5), causing them to react with electrophiles such as the 
bromoacetyl group, and leading to undesired reaction products. 
Obviously a compromise must be struck between these two 
effects. Similar observations have been made by Wetzel and 
co-workers (Wetzel et al., 1990). 

Initially, the ligation reaction was performed under aqueous 
conditions (pH 7.0, O.l M phosphate, 10 mg/mL in each 
reactani) and the progress of the ligation reaction was 
monitored by analytical HPLC (see Figure 4a). Individual 
HPLC peaks were collected and examined by mass spec- 
trometry. Two chemical reactions were observed under these 
conditions. The desired ligation product (thioether-linked helix 
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dimcr) was formed in significani amounts after'only 45 min. 
However, substantial amounts of disuifide-linked homodimcr 
of the cysteinyl component also formed under these conditions. 
Formation of this oxidized species effectively protects the 
reactive sulfhydry] group, thereby substantially reducing the 
ligation yield. 

Next we sought to enhance the nucleophilic ligation reaction 
relative to the unwanted oxidation reaction. Air oxidation of 
a mercaptan to a disulfide follows a multistep mechanism 
involving sulfur radical formation (Wallace et al., 1963), 
whereas the nucleophilic attack of a thiolate ion on a primary 
alkyl halide, such as the bromoacetyl unit, occurs by an Sn2 
mechanism. Thus, the reaction rate for ligation should be 
greatly increased relative to oxidation by the use of a dipolar 
aprotic solvent. This is because of two effects: enhanced 
reactivity of the charged nucleophiiedue to reduced solvation, 
and better solvation of the charge-separated transition state 
(Parker, 1969). 

To test this hypothesis, the ligation reaction was repeated 
using a solvent system composed of 95% dimethylformamide 
(DMF) and 5% OA M phosphate at pH 7.0, the aqueous 
phosphate being added to ensure thiolate formation (Baca & 
Kent, 1993). To further disfavor dimerization of the cysteinyl 
component, the concentration of each reactant was reduced 
from 10 mg/mL to 5 mg/mL. Under these conditions the 
nucleophilic ligation reaction proceeded extremely quickly. 
Considerable amounts of product were present at the earliest 
time points examined, after less than I min of reaction. After 
2.5 h the reaction had already gone to completion, as indicated 
by the complete disappearance of the cysteinyl component 
(see Figure 4b). Importantly, absolutely no disulfide-linked 
dimer was observed. Thus, by use of the dipolar aprotic solvent 
DMF, disulfide formation can be eliminated in favor of the 
desired liaation reaction. 

Synthesis of the Model Proteins MP- 1 and MP'2. The 
synthesisof MP-2, by reaction of H-Cys-[anb]-OH with BrAc- 
[j(33]-OH, was similarly performed in 95^ DMF/5% phosphate 
(pH 7.0) using a concentration of 5 mg/mL in each reactant. 
As expected from the model ligation studies, the reaction 
proceeded virtually to completion under these conditions. The 
ligation product, MP-2, was subsequently purified by pre- 
parative HPLC and its covalent structure confirmed by ion- 
spray mass spectrometry (see Figure 5a). 

The neoprotein MP- 1 was prepared by chemical dovetailing 
of the peptides H-Cys-G-(K-L-E-A-L-E-G)4-(aiib]-OH and 
BrAc-G*(K'L-E'A-L-E-G)4-[ii33]-OH. Chemoselective ligation 
was carried out in 95% DMF/5% 0.1 M phosphate (pH 7.0) 
containing 8 M urea, using a concentration of 5 mg/mL in 
each reactant. The inclusion of urea in the mixed solvent 
system was found to greatly aid solubility of the two polypeptide 
fragments. The ligation product, MP-1, was purified to 
homogeneity by preparative HPLC and characterized by ion- 
spray mass spectrometry (see Figure 5b). Significantly, this 
ligation reaction was observed to proceed much slower than 
for the model ligation studies and for the synthesis of MP-2, 
both described above. After 12 h, the yield of product was 
approximately 60'^r. Ai this point the reaction was terminated 
since the residual bromoacetylated peptide had almost all been 
converted to the unreactive chloroacetyl adduct (as indicated 
by mass spectrometry) by chloride ions of unknown origin. In 
the model ligation studies and the synthesis of MP-2, this 
formation of chloroacetyl peptide was not a problem since the 
ligation reaction itself was extremely rapid. We have no 
explanation as to the reasons behind this unexpected sluggish 
ligation. 
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Figure 5: Characterization of the purified synthetic neoprotcins: 
(a) MP-2; (b) MP- 1 . Main panels: ion-spray mass spectra showing 
charge state distribution. Inserts: analytical reverse-phase HPLC 
spectra. Data clearly reveal that MP- 1 and MP-2 both possess the 
desired covalent structure and are highly homogeneous. 

Circular Dichroism Studies. Preliminary structural studies 
on the synthetic molecules described were performed using 
ultraviolet circular dichroism (UV CD) spectroscopy. 

{A) Cytoplasmic Tails. The far UV CD spectra of the 
cytoplasmic tails of both anb and were taken in boric acid, 
pH 7.0 at 25 "C (see Figure 6a). Significantly, in each case 
the absence of distinct minima at 208 and 222 nm revealed 
that neither cytoplasmic tail contained appreciable helical 
secondary structure in aqueous solution. However, in other 
respects the two CD spectra were strikingly different. The 
spectrum of the 47 amino acid ^3 cytoplasmic tail contained 
a strong minimum at 203-205 nm as opposed to that of the 
20 amino acid anb cytoplasmic tail which was characterized 
by a small maximum at 219-220 nm and a weak negative 
ellipticity at 235 nm (Figure 6a). 

The far-UV CD of theotub cytoplasmic tail clearly indicates 
a nonhelical structure. The spectrum has many features in 
common with classical random-coil poly(amino acids) 
(Townendetal., 1966). Theprimary sequence of the anb tail 
contains a series of six acidic amino acids in a row 
(Glu'Glu'Asp-Asp'Glu'Glu), and at pH 7.0 there is potential 
for electrostatic repulsion between neighboring carboxylates, 
perhaps accounting for this disordered structure. The 
cytoplasmic tail exhibits a far-UV CD spectrum which differs 
significantly from the classical random-coil spectrum exhibited 
by the anb cytoplasmic tail. The spectrum indicates that the 
02 cytoplasmic tail also contains little if any a-helix. Thus, 
while both isolated cytoplasmic tails are clearly nonhelical, 
these spectral differences point to the cytoplasmic tails having 
dissimilar structural properties. 

{B) HeliX'Cytoplasmic Tail Molecules. The far-UV CD 
spectra of both H-G.(K-L»E-A-L-E-G)4-[aiib]-0H(helix-aiib) 
and H-G-(K-L-E'A-L-E-G)4-[i33]-OH (helix-/?}) were mark- 
edly different from the corresponding cytoplasmic domains 



7706 Biochemistry, Vol. 33. No. 24, 1994 



Muir et al. 



a. 



10000 



sooo 



o 
£ 

E 

u 
a 

ii 

■a 



■ SOOO 



-tcoco 



-15000 ■ 



•20000 




190 



210 230 

Wavelength (nm) 



250 



b. 



5000 



o 
E 

CM 

E 
o 

« 



•scoo 



•10000 - 



•iscoc ■ 



.20000 




200 



220 



2*0 



260 



Wavelength (nm) 

Figure 6: Circular dichroism studies of synthetic polypeptides 
containing the isolated cytoplasmic domains of the integrin otub/Sj. 
(a) Isolated anb and ^3 cytoplasmic domains, (b) H-G'(K-L-E*A- 
L'E-G)4-[aiibl-0H (helix-aub) and H-G-(K-L-E-A-L-E-G)4-[/33]-OH 
(hclix-ft). 



Table 2: Percent Hclicity of Synthetic Proteins. Calculated from 


Far-UV CD Spectra Using Eqs 1 and 2 






protein 




sample 


hclicity {%) 


helix-QcIIb 


44 


1.02 


hclix-^j" 


23 


1.25 


helix dimer 


85 


1. 00 


MP- 1 


74 


1.08 



alone (see Figure 6b). Both helix-aub and helix-)33 exhibited 
bimodal UV CD spectra with minima at 208 and 222 nm, 
indicative of helical secondary structure (Hoizwarth & Doty, 
1965). The protein helicity estimated from Wzii using eq 1 
and 2 (see Experimental Procedures) was 44% for helix-anb 
and 23% for helix-j?3 (Table 2). The absence of detectable 
helicity in the individual anb and 0^ cytoplasmic tails suggests 
that the a-helix is restricted to the N-iermina! pro-(coiled 
coil) sequence in helix-otub and helix-53. The (^]222/[^]203 
ratio has previously been used to assess the number of helical 
strands within a molecule (Lau et al., 1984). A value for 
[^] 222/(^1 208 of around 0.8 is associated with single-stranded 
a-helix, whereas a value of about unity is suggestive of a two- 
stranded coiled coil. By this measure helix-ot nb with a [0] 222/ 
(5] 208 value of 1.02 would appear to contain a coilcd-coil 
structural unit. Similarly, helix-ft ( [8] ml [8] 208 = 1 .20) would 
also contain a coiled coil by this criterion. 
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Figure 7: Circular dichroism studies of the synthetic ncoprotein 
structural models MP- 1 , M P-2, and the covalent helix dimer. From 
these spectra it is clear that MP-1 and helix dimer contain high levels 
of helicity. whereas NfP-2 is essentially nonhclical. 

The moderate protein helicity exhibited, by the helix- 
[cytoplasmic tail] molecules would seem to preclude the 
intramolecular formation of a coiled-coil architecture, implying 
that the molecules exist as homodimeric coiled coils. To test 
this hypothesis, size exclusion chromatography was performed 
on both molecules. When comapred with a number of peptide 
standards of varying length, the retention volumes of helix- 
aiib (monomer = 5.4 kDa) and helix-jSj (monomer = 8.7 kDa) 
corresponded to apparent masses of approximately 9.9 and 16 
kDa, respectively. These data support the existence of ho- 
modimeric species. The putative intermolecular coiled coils 
within these homodimers presumably involve the noncovalent 
association of the two N-terminal amphiphilic helices stabilized 
by hydrophobic interactions. It is important to note that the 
amphiphilic sequences within these coiled coils can potentially 
associate in either a parallel or antiparallel arrangement, and 
that a low-resolution technique such as circular dichroism 
cannot distinguish between these two possible orientations. 

(O Helix Dimer, MP-l, and MP'2. The CD spectrum of 

the 60 amino acid Cys-G-(K-L-E-A-L-E-G)4CH2-G-(K'L-E- 
A'L*E*G)4 (helix dimer) was indicative of an a-helical coiled- 
coil protein (Figure 7). Both the estimated protein helicity 
of 85% (Table 2) and the [^]222/[^]208 value of exactly 1.00 
confirm the presence of this tertiary structure in the covalently 
linked molecule. This observation is of key importance since 
the presence of a coiled coil in such a parallel architecture is 
a fundamental design feature of the model protein MP-1. The 
far-UV CD spectrum of MP-2 reveals that this 69-residue 
molecule does not contain substantial helical secondary 
structure (Figure 7), an observation which is entirely consistent 
with the CD studies on the individual cytoplasmic tails. 

The far-UV CD spectrum of the 126-residue ncoprotein 
MP-1 contained minima at 208 and 222 nm, again indicating 
the presence of helical secondary structure (Figure 7). The 
calculated [6] 222/ [d] 208 value of 1 .08 suggests that; as expected, 
the molecule does contain an area of a-helical coiled coil. The 
retention volume of the MP- 1 (monomer = 1 4 kDa) molecule 
on gel permeation chromatography corresponded to an 
apparent mass of 18 kDa. This suggests that the molecule 
exists as a monomer in aqueous solution, implying that this 
coiled coil is intramolecular. In stark contrast to the nonhclical 
MP-2, the protein helicity of MP- 1 was estimated to be some 
74%. On the basis of the covalent structure of the MP- 1 
molecule, the expected coiled-coil region makes up only 47% 
of the amino acid content of the molecule. Thus the 74% 
observed helicity (Table 2) indicates that helicity extends 
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Figure 8: Fluorescence quenching studies as a conformational probe 
of the synthetic neoprotein structural models. Stcrn-Volmer plots 
of MP- U MP-2, isolated and isolated i3i ( I equiv) + isolated aub 
(1 equiv). F° and F refer to ih& tryptophan fluorescence in the 
absence and presence of quencher, respectively. [I*] is the concen- 
tration of iodide quencher. These data reveal a significant difference 
in the solvent exposure of the single tryptophan in MP-1 compared 
to the other molecules studied. 

beyond the expected amphiphilic coiled-coil region of the 
molecule into the cytoplasmic domains, h is significant that 
this additional helicitv was not observed for either helix-anb 
or heli.K-.J3, both of which appear to exist as homodimers 
containing coiled coils. This argues that the coiled coil unit 
is not itself sufficient to induce helicity into the cytoplasmic 
tails. Therefore, the additional helicity which is unique to the 
MP-1 molecule must be a result of the anb and 0^ tails being 
present in a staggered and parallel orientation within this 
molecule. For this reason, we sought to further define the 
structures of the ''cytoplasmic domains" of the neoprotein 
.MP-1. 

Fluorescence Quenching Studies, Quenching of protein 
fluorescence by addition of external heavy atoms such as the 
iodide ion can provide valuable information regarding the 
local structural environment of the tryptophan or tyrosine 
sidechainfluorophores(Lehrer&Leavis, 1978). Quenching 
occurs as a result of diffusion-controlled encounters of the 
heavy atom with the fluorophore and so can be related to the 
solvent exposure of the -residue. The MP- 1 neoprotein 
construct contains a single tryptophan fluorophore located 24 
residues from the C-terminai of the 3} tail. This can thus be 
used as a convenient structural probe into this area of the 
protein. 

Iodide quenching data was obtained for ,MP-1 . MP-2, the 
0} cytoplasmic tail, and the 3i tail mixed with a stoichiometric 
amount of the anb cytoplasmic tail. The data sets are presented 
as plots of versus iodide concentration, where F° and 
fare fluorescence in the absence and presence of quencher, 
respectively (Figure S). In each case the fluorescence 
quenching appears to follow the simple Stern-Volmer rela- 
tionship (eq 3), suggesting that thequenching of the tryptophan 
fluorescence is not complicated by energy transfer to tyrosine 
residues (the nearest of which is eight residues downstream). 
The resulting Stern-Volmer constants Kq, calculated from 
the graphs using eq 3, reveal a significant difference between 
MP-1 and the other molecules studied (Table 3), 

There appears to be little difference in the solvent exposure 
of the lone tryptophan of MP-2 and the cytoplasmic tail 
alone as indicated by the similar values of Kq calculated for 
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Table 3: Fluorescence Quenching Studies of the Synthetic 


Neoprotein .MP- 1 and Related Control Compounds: Stern-Volmer 


Constants {Kq) Calculated Using Eq 3 




sample 


^q(M-') 




4.3 ± 0.2 


^3 + crllb 


4.7 ± 0.3 


MP-: 


4,4 ± 0.2 


MP- 1 


0.84 ± 0.06 



each. Furthermore, addition of the anb cytoplasmic tail to 
a solution of the ^3 cytoplasmic tail has no apparent effect on 
the tryptophan quenching. The MP- 1 fluorescence, on the 
other hand, is quenched to a considerably lesser extent, 
indicating that the single tryptophan in this neoprotein 
molecule is substantially protected from solvent. In light of 
the measurements on control compounds, we think that this 
solvent shielding can be attributed to tertiary interactions 
between the aub 3} cytoplasmic tails with the formation 
of a hydrophic core involving^the tryptophan. This suggests 
that the neoprotein, MP-l, has a defined tertiary structure. 

It is important to mention that two slightly different 
sequences for the cytoplasmic domain of anb are present in 
the literature. All the studies reported above refer to the aub 
sequence shown in Figure 2. However, model proteins 
containing the other anb cytoplasmic tail sequence were also 
constructed using the same procedure as before. Note that 
this variant differs at only two positions, His''^ and Thr'^' 
(Prandini ei al., 1988) to Pro^^^ and Pro999 (Poncz et al., 
1987). These molecules were found to possess essentially 
identical structural properties to the above when studied by 
both circular dichroism and fluorescence spectroscopies. 

SIGNIFICANCE 

In summary, these preliminary structural results suggest 
that the synthetic protein MP-1 is a promising soluble model 
for the cytoplasmic domains of the heterodimeric receptor, 
integrin a\i\ii3y MP-1 has a defined tertiary structure, with 
unique features present only in the model protein, namely, the 
coiled-coil template mimics the transmembrane helices of the 
two subunits, at the same time constraining the cytoplasmic 
tails in a staggered, parallel and proximal topology. Further 
studies are now underway to determine the structure and 
biological activity of the MP- 1 construct through high- 
resolution nuclear magnetic resonance (N MR) spectroscopy 
techniques and a range of functional assays. 

The ability to study a putative functional model of the 
cytoplasmic domains of integrin anb^s in non membrane bound 
conditions will allow systematic exploration of the structural 
origins of biological effects. The use of chemically synthesized 
neoprotein structural models of nonlinear architecture to mimic 
complex membrane receptors may be a useful approach to 
study signaling processes in multisubunit transmembrane 
receptors. 
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